Skip to main content

Matching Dirty Data – Yet Another Wheel - Anjanette Young and Jeff Sherwood


Topics c4l10_staging


This talk demonstrates one method of matching sets of MARC records that lack common unique identifiers and might contain slight differences in the matching fields. It will cover basic usage of several python tools. No large stack traces, just the comfort of pure python and basic computational algorithms in a step-by-step presentation on dealing with an old library task: matching dirty data. While much literature exists on matching/merging duplicate bibliographic records, most of this literature does not specify how to accomplish the task, just reports on the efficiency of the tools used to accomplish the task, often within a larger system such as an ILS.


Run time 20 minutes 31 seconds
Audio/Visual sound


Reviews

There are no reviews yet. Be the first one to write a review.
PEOPLE ALSO FOUND
Community Video
by Ryan Scherle
22
0
0
Community Video
10
0
0
Community Video
19
0
0
Community Video
6
0
0
Community Video
11
0
0
Community Video
23
0
0
Community Video
30
0
0